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^ ! Abstract 

We consider a network of single-antenna sensors that observe an unknown deterministic parameter. 
J> " Each sensor applies a phase shift to the observation and the sensors simultaneously transmit the result 

: to a multi-antenna fusion center (FC). Based on its knowledge of the wireless channel to the sensors, 

qq ■ and feeds this information back to the sensors. The use of a phase-shift-only transmission scheme 



the FC calculates values for the phase factors that minimize the variance of the parameter estimate, 



■ provides a simplified analog implementation at the sensor, and also leads to a simpler algorithm design 



and performance analysis. We propose two algorithms for this problem, a numerical solution based 
on a relaxed semidefinite programming problem, and a closed-form solution based on the analytic 
constant modulus algorithm. Both approaches are shown to provide performance close to the theoretical 
bound. We derive asymptotic performance analyses for cases involving large numbers of sensors or 
large numbers of FC antennas, and we also study the impact of phase errors at the sensor transmitters. 
Finally, we consider the sensor selection problem, in which only a subset of the sensors is chosen to 
send their observations to the FC. 

Index Terms 

Wireless sensor networks, analog sensor networks, distributed beamforming, phase-only beam- 
forming, sensor management 
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I. Introduction 

A. Background 

Wireless sensor networks (WSNs) have been widely studied for detection and estimation 
problems. Recently, considerable research has focused on the fusion of analog rather than 
encoded digital data in a distributed sensor network to improve estimation performance. The 
advantages of analog WSNs have been established in [QQ|-[[3]|, where it was shown that when 
using distortion between the source and recovered signal as the performance metric, digital 
transmission (separate source and channel coding) achieves an exponentially worse performance 
than analog signaling. A number of studies have focused on algorithm development and analysis 
for analog WSNs with a single- antenna fusion center (FC). In flU, the sensors amplify and 
forward their observations of a scalar source to the FC via fading channels, and algorithms are 
developed to either minimize estimation error subject to transmit power constraints or minimize 
power subject to estimation error constraints. The scalar source model for this problem was 
generalized to correlated vector sources in [[5]|. An opportunistic power allocation approach was 
proposed in BH, and the scaling law with respect to the number of sensors was shown to be the 
same as the optimal power allocation proposed in 01. In 0, the asymptotic variance of the best 
linear unbiased estimator of an analog WSN is derived, together with an analysis of the effect 
of different assumptions regarding channel knowledge at the sensors. Scaling laws with respect 
to the number of sensors have been studied in [8] for a diversity-based method (where only 
the sensor with the best channel transmits), as well as for the coherent multiple access channel 
(MAC) and orthogonal channel cases, assuming a Gaussian source. In flU, a power optimization 
problem was formulated to minimize the outage probability of the MSE for the coherent MAC 
channel. More complicated settings involving analog WSNs with nonlinear measurement models 
ifTOl or relays [fTTI . Ifl2ll have also been studied. 

The results described above all assume that the FC is equipped with only one antenna. Just 
as multi-antenna receivers can provide significant capacity or diversity gains in communication 
systems, the estimation performance of a WSN should also benefit from the use of a multi- 
antenna FC, though prior work on this scenario is limited. A general scenario is investigated in 
[fT3l . involving vector observations of a vector-valued random process at the sensors, and linearly 
precoded vector transmissions from the sensors to a multi-antenna FC. Optimal solutions for the 
precoders that minimize the mean-squared error (MSE) at the FC are derived for a coherent MAC 
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under power and bandwidth constraints. In [fT4ll . single-antenna sensors amplify and forward their 
observations to a multi-antenna FC, but it is shown that for fading channels, the improvement 
in estimate variance is upper bounded by only a factor of two compared to the case of a single- 
antenna FC. The performance of two heuristic algorithms for choosing the gain and phase of 
the sensor transmissions is also studied. In another paper lfT5l . the same authors showed that 
there is a limit to the improvement in detection performance for a multi- antenna FC as well. 

The term "amplify and forward" is often used to describe analog sensor networks like those 
discussed above, since each sensor applies a complex gain to the observation before sending 
it to the FC. For a coherent MAC, one can think of this as a type of distributed transmit 
beamforming, although it is distinguished from distributed beamforming applications such as 
those in communications since in a WSN the observed noise is transmitted together with the 
signal of interest. Some prior research in radar and communications has focused on scenarios 
where the beamformer weights implement only a phase shift rather than both a gain and a phase. 
The advantage of using phase shifting only is that it simplifies the implementation and is easily 
performed with analog hardware. Phase- shift-only beamformers have most often been applied to 
receivers that null spatial interference lfT6l . ifPTll . but it has also been considered on the transmit 
side for MISO wireless communications systems lfT8l . which is similar to the problem considered 
here. For the distributed WSN estimation problem, phase-only sensor transmissions have been 
proposed in lfT9l . where the phase is a scaled version of the observation itself. 

In addition to the work outlined above, other WSN research has focused on sensor selection 
problems, particularly in situations where the sensors have limited battery power. In these 
problems, only a subset of the sensors are chosen to transmit their observations, while the others 
remain idle to conserve power. The sensor selection problem has been tackled from various 
perspectives, with the goal of optimizing the estimation accuracy [fTTI . Il20l . [12711 or some heuristic 
system utility |[22~1|. Il23l . In [20J, the authors investigated maximum likelihood (ML) estimation 
of a vector parameter by selecting a fixed-size subset of the sensors. An approximate solution 
was found by relaxing the original Boolean optimization to a convex optimization problem. A 
dynamic model is used to describe the parameter of interest in [|2T|. and sensors use the Kalman 
filter to estimate the parameter. At each time step, a single sensor is selected and the measurement 
at the selected sensor is shared with all other sensors. A numerical sensor selection algorithm 
was proposed to minimize an upper bound on the expected estimation error covariance. Instead 
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of the estimation accuracy, a utility function that takes into account the measurement quality or 
energy cost can also be used as the metric for sensor selection. In 11231 . each sensor independently 
optimizes its own operation status based on a utility function which depends on the sensor's 
own measurement and the predicted operation status of other sensors. A threshold is then found 
to enable the sensor to switch its status for either energy efficiency or energy consumption, and 
a power allocation algorithm was proposed to minimize the MSE at FC. 

B. Approach and Contributions 

In this paper we consider a distributed WSN with single- antenna sensors that observe an 
unknown deterministic parameter corrupted by noise. The low-complexity sensors apply a phase 
shift (rather than both a gain and phase) to their observation and then simultaneously transmit 
the result to a multi-antenna FC over a coherent MAC. One advantage of a phase-shift-only 
transmission is that it leads to a simpler analog implementation at the sensor. The FC determines 
the optimal value of the phase for each sensor in order to minimize the ML estimation error, and 
then feeds this information back to the sensors so that they can apply the appropriate phase shift. 
The estimation performance of the phase-optimized sensor network is shown to be considerably 
improved compared with the non-optimized case, and close to that achieved by sensors that can 
adjust both the transmit gain and phase. We analyze the asymptotic behavior of the algorithm 
for a large number of sensors and a large number of antennas at the FC. In addition, we analyze 
the impact of phase errors at the sensors due, for example, to errors in the feedback channel, a 
time-varying main channel or phase-shifter drift. We also consider a sensor selection problem 
similar to that in ll20ll . and analyze its asymptotic behavior as well. Some additional details 
regarding the contributions of the paper are listed below. 

1) We present two algorithms for determining the phase factors used at each sensor. In the 
first, we relax the original problem to a semidefinite programming (SDP) problem that can 
be efficiently solved by interior-point methods. For the second algorithm, we apply the 
analytic constant modulus algorithm (ACMA) ||24l|. which provides a closed-form solution. 
We demonstrate via simulation that the performance of both of the proposed algorithms 
is close to the theoretical lower bound on the estimate variance. 

2) We separately derive performance scaling laws with respect to the number of antennas and 
the number of sensors assuming non-fading channels with path loss. For both cases, we 
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derive conditions that determine whether or not the presence of multiple antennas at the 
FC provides a significant benefit to the estimation performance. That there exist non-fading 
scenarios where multiple antennas significantly lower the estimate variance is in contrast 
to the pessimistic results obtained in Ifl4l . |fl"5l for channels with fading. 

3) We conduct an analysis of the impact of phase errors at the sensors assuming relatively 
small phase errors with variance ^ < 1 (square-radians). In particular, we show that the 
degradation to the estimate variance is bounded above by a factor of 1 + of. 

4) We consider the sensor selection problem separately for low and high sensor measurement 
noise. For the low measurement noise scenario, we relax the sensor selection problem to 
a standard linear programming (LP) problem, and we also propose a reduced complexity 
version of the algorithm. For the high measurement noise scenario, we show that the 
estimation error is lower bounded by the inverse of the measurement noise power, which 
motivates the use of a simple selection method based on choosing the sensors with the 
lowest measurement noise. 

A subset of the above results was presented in an earlier conference paper ll25l . 

C. Organization 

The paper is organized as follows. Section UU describes the assumed system model. Section ITTll 
formulates the phase optimization problem and proposes a numerical solution based on SDP as 
well as a closed-form solution based on the algebraic constant modulus algorithm. In Section ITVl 
the asymptotic performance of the algorithm is analyzed for a large number of sensors and 
antennas. The effect of phase errors is analyzed in Section |V] and the sensor selection problem 
is investigated in Section |VIl Numerical results are then presented in Section IVIII and our 
conclusions can be found in Section IVIIIl 

II. System Model 

We assume that N single- antenna sensors in a distributed sensor network independently 
observe an unknown but deterministic parameter 9 E C according to the following model for 
sensor i: 

y i = 9 + Vi , 
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where is Gaussian observation noise with variance a^. The noise is assumed to be independent 
from sensor to sensor. Each sensor phase shifts its observation and transmits the signal to 
the FC, where |a$| = 1. Assuming a coherent MAC and an FC with M antennas, the vector 
signal received at the FC can be expressed as 

y = Ha# + HDv + n, (1) 

where H = [hi, . . . , hjv] and h, 6 c Mxl [ s the channel vector between the ith sensor and the 

FC, a = [ai, . . . , ajv] T contains the adjustable phase parameters, D = diagjai, . . . , oat}, v is the 

sensor measurement noise vector with covariance V = Ejvv^} = diag {o%i, • ■ • , &vn}> an d n 

is Gaussian noise at the FC with covariance E{nn^} = cr^I M , where 1 M is an M x M identity 

matrix. Note that since the sensors can only phase shift their observation prior to transmission, 

we ignore the issue of power control and assume that the sensors have sufficient power to forward 

their observation to the FC. 

The combined noise term HDv + n in © is Gaussian with covariance HVH H + a 2 n I, since 

DVD 5 = V due to the phase-only assumption. Assuming the FC is aware of the channel matrix 

H, the noise covariance V and <j 2 n , it can calculate the ML estimate of 9 using 11261 

~ a g H fl (HVH fl + ^I M )^y 

ML a«H«(HVH^ + a2l M )-iHa ' 

The estimator 9ml is unbiased with variance 

Var(^ /L ) = (a"H H (HVH H + ^I M )- 1 Ha)" 1 . (2) 

Furthermore, since ||a|| = iV when only phase shifts are used at the sensors, it is easy to see 
that the variance is lower bounded by 

Var( ^ /L) " ArA max (H-(HVH- + ^I A/ )-H) ' (3) 
where A max (-) denotes the largest eigenvalue of its matrix argument. Note that the bound in ([3]) is 
in general unachievable, since with probability one the given matrix will not have an eigenvector 
with unit modulus elements. 

III. Optimizing the Sensor Phase 

In this section we consider the problem of choosing a to minimize Var(^ MI ,) in ([2]). The 
unit modulus constraint prevents a trivial solution, but as we note below, a direct solution is not 
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possible even without this constraint since the noise covariance would then depend on a. The 
general optimization problem is formulated as 

min Vw(§ M l) (4) 

a 

S.t. = 1, % — 1, . . . , N . 

Defining B = H H (HVH H + ct^Ia/) _1 H, the problem can be rewritten as 

max a ff Ba (5) 

a 

s.t. \cii\ = 1, i = 1, . . . , N . 
If there are only two sensors in the network, a simple closed-form solution to © can be obtained. 

\ a be ^ 1 r -R -R i 

Defining B = with a, b, c > and a = [e Jlil , e 31 ^ 2 }, then a^Ba is calculated as 

b, c 

a fl Ba = a + c + 2b cos{f3 1 - - 0) 

< a + c + 2b, (6) 

and the equality in © can be achieved for any f3 2 that satisfy pi — f3 2 = ft. 

For the general situation where N > 2, a solution to © appears to be intractable. Instead, in the 
discussion that follows we present two suboptimal approaches in order to obtain an approximate 
solution. The first approach is based on an SDP problem obtained by relaxing a rank constraint 
in a reformulated version of ©. The second converts the problem to one that can be solved via 
the ACMA of ll24ll . It is worth emphasizing here that if the transmission gain of the sensors was 
also adjustable, then the corresponding problem would be 

max a H H ff (HDVD fl H Jf + ^I M )- 1 Ha (7) 

a 

s.t. a H a < N , 

which also has no closed-form solution due to the dependence on a (through the matrix D) 
inside the matrix inverse. While in general both problems require numerical optimizations, we 
will see in Sections IV- VI that the theoretical analysis of performance and the solution to the 
sensor selection problem is considerably simpler with the phase-only constraint. The simulations 
of Section VII will also demonstrate that there is often little performance loss incurred by using 
phase- shift-only transmissions. 
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A. SDP Formulation 

To begin, we rewrite © as follows: 



max 

A 

S.t. 



tr(BA) (8) 
A M = 1, i = 1,...,N 
rank(A) = 1 
A y , 

where A* ti denotes the ith diagonal element of A. We then relax the rank-one constraint, so that 
the problem becomes a standard SDP: 



max 

A 

S.t. 



tr(BA) 

A y o . 



(9) 



N 



Defining B r = real{B}, Bj = imag{B}, and similarly for A r and Aj, we can convert © to 
the equivalent real form 



max 

{A r ,Ai} 
S.t. 



tr(B r A r 



BjAj) 



(10) 



AT 



>- . 



A, A r 

Problem ((101) can be efficiently solved by a standard interior-point method H27I . 

In general, the solution to (flOl ) will not be rank one, so an additional step is necessary to 
estimate a. Let A*, A* denote the solution to problem (flOl) . then the solution to problem © is 
given by A* = A* + jA*. If rank(A*) > 1, we can use a method similar to Algorithm 2 in 
[|28l to extract a rank-one solution, as follows: 

1) Decompose^ A* = C H C, define B = CBC^, and find a unitary matrix U that can 
diagonalize B. 

2) Let r £ C Nxl be a random vector whose ith element is set to e JW % where Ui is uniformly 
distributed over [0,27r). 



'Since A* is the solution to problem l|9j, A* is positive semidefinite. 
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3) Set a = C H Ur, and the solution is given by a* = [a* ■ ■ ■ a* N } T , where a* = e jZai . 
A detailed discussion of the reasoning behind the above rank-one modification can be found in 

EEL 

B. ACMA Formulation 

For this discussion, we will assume that iV > M, which represents the most common scenario. 
Thus, the N x N matrix B in the quadratic form a H Ba that we are trying to maximize is low 
rank; in particular, rank(B) < M < N. Clearly, any component of a orthogonal to the columns 
or rows of B will not contribute to our goal of minimizing the estimate variance. In particular, 
if we define the singular value decomposition (SVD) B = USU H , we ideally seek a vector a 
such that 

rn 

a = ^ w k^k = U m w 

fe=i 

\<k\ = 1 , 

where U = [ui • ■ ■ u m ] contains the first m < rank(B) < M singular vectors of B and 
w = [w\ ■ ■ ■ w m ] T . The problem of finding the coefficient vector w of a linear combination of 
the columns of a given matrix U m that yields a vector with unit modulus elements is precisely 
the problem solved by the ACMA ll24l . 

Our problem is slightly different from the one considered in ll24l . since there will in general 
be no solution to (fTT|) even in the absence of noise. However, in our numerical results we will 
see that the ACMA solution provides performance close to that obtained by the SDP formulation 
above. Note also that there is a trade-off in the choice of m, the number of vectors in span(B) 
to include in the linear combination. A small value of m allows us to focus on forming a from 
vectors that will tend to increase the value of a H Ba, while a larger value for m provides more 
degrees of freedom in finding a vector whose elements satisfy |aj| = 1. Another drawback to 
choosing a larger value for m is that the ACMA solution can only be found if iV > m 2 . As 
long as M is not too large, one could in principle try all values of m = 1, • ■ ■ , M that satisfy 
N > m 2 and choose the one that yields the smallest estimate variance. We will see later in the 
simulations that a small value for m already provides good performance, so the choice of m is 
not a significant issue. 
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Since we are seeking only a single linear combination in our problem, a simplified version of 
ACMA can be used, as outlined in the following steps for a given m. Details on the reasoning 
behind the algorithm can be found in Il24ll . Note that in the algorithm below (■) denotes the 
complex conjugate. 

1) Define = [ui ■ • • un] and the following N x m 2 + 1 matrix: 



Ui 



UiJ -1 



(u N ®\i N ) H -1 

2) Let q represent the right singular vector of P associated with the smallest singular value, 
and define the vector q to contain the first m 2 elements of q. 

3) Set w equal to the singular vector of Q + with largest singular value, where the m x m 
matrix 

Q = vec _1 (q) (12) 



(11) 



is formed by dividing q into sub-vectors of length m and stacking them together in a 
matrix. 

4) Set a = U m w. Estimate the vector a by stripping away the magnitude of the elements of 



IV. Asymptotic Performance Analysis 

In this section, we analyze the asymptotic performance achievable using only phase-shifts for 
the sensor transmissions. We will separately study cases where the number of sensors is large 
(N — » oo) or the number of FC antennas is large (M — > oo). Our analysis will be based on an 
a non-fading channel model that takes path loss into account, similar to models used in ||29ll , 
||30l . In particular, for the channel between the FC and sensor i, we assume 

hj = — hi , 

df ' 

where di denotes the distance between the iih. sensor and the FC, a is the path loss exponent 
and hi is given by 

hi = [e^- 1 e j ^ 2 ■ ■ ■ e j ^ M ] T , 
where jij is uniformly distributed over [0, 2tt). 
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A. Estimation Performance for Large N 

From © we know that the lower bound on Vax(6 ML ) depends on the largest eigenvalue of 
H // (HVH H + cr^lAf) _1 H. We begin by deriving a lower bound for this eigenvalue. The (m, n)th 
element of HVH ff can be expressed as 



(HVH") = y 

V / m.n / 4 



i=i 1 

According to the strong law of large numbers, as N — > oo we have 

1 N pi(7 l ,m-7 l ,n) rr 2 ( a 2 \ 

1=1 * ^ 4 

(6) J E {^j m = n 
m n 

where (a) follows from the assumption that ji m , di and cx^ are independent and (b) is due 
to the fact that 7 i m and 7 i n are independent and uniformly distributed over [0, 2ir). Thus, for 
sufficiently large N we have 

HVH^iVE j^jl M . (14) 



(13) 



Based on (PT41) . we have 

lim A max (H if (HVH if + ^lM)- 1 H) 



i 



lim A max (H if H) 

JV— >oo 



ATE . , 

(c) I * 



(15) 

where (c) is due to the fact that A max (H H H) = A max (HH F ). Plugging (Q3) into ©, we have 
the following asymptotic lower bound on the estimate variance: 



Var(0 ML ) > 1 '/ , . (16) 

For large enough N, the lower bound can be approximated using sample averages: 

Y N <i + a 2 
Z-/i=l d?" ' n 

Z 

Next, we derive an upper bound on the estimate variance and compare it with the lower bound 
obtained above. The upper bound is obtained by calculating the variance obtained when only 



Var(fl ML ) > 77 ^ i . (17) 
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a single antenna is present at the FC. For the given channel model, the optimal choice for the 
vector of sensor phases is just the conjugate of the channel phases: a = [e -J7M ■ • ■ e -J ' 7jv > 1 ] T , 
which when applied to © leads to 

y N <i + a 2 

Var(fl ML ) < . (18) 

When N — > oo, both the upper and lower bounds converge to 0, but the ratio of the lower bound 
in (flTT) to the upper bound in ([TBI converges to 



(E£,*)' H»}) 2 M^} 



lim — — = ' ) ' '' = 1 . (19) 

™E,l l5 fc e{^} e{^} 

Interestingly, we see that if Var { ^ j ^ E j ^p- 1 , the gap between the upper and lower bound 
is very small, and the availability of multiple antennas at the FC does not provide much benefit 
compared with the single antenna system when N — > oo. On the other hand, if Var j^j — > 
E j i >, the potential exists for multiple antennas to significantly lower the estimate variance. 



B. Estimation Performance for Large M 
Using the matrix inversion lemma, we have 



i-M) xi = xi |— i M - , - , „■ 

■ n n \ n 



= ^rH^H - ^H^H ( V- 1 + -^-H^h) H^H . (20) 

< < \ < ) 

Furthermore, the (m, n)th element of H H H is given by 

1 M 

(H H B) = e i(7n,i-7 m ,i) . (21) 

V / m,n Ja / j ■ 

u m u 'n i=l 

Similar to (fl"3l ), as M — > oo we have 

1 M ( , 

1 \ 1 ra — n 

lim — y eJ(Tn,i-Tv»,<) = ; (22) 



i=l 



and thus 



to oo H«H=Mdia € { 5 L... 5 L}. (23) 
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Substituting d23) into (1201) . we have 

M M 



lim H H (HVH^ + ^I M )- 1 H = diag 



" > nM > ta ] « + M<, ' ' d^ol + Mal N 

and thus 

lim Var(0 Mi ) = 1 . (24) 



Note that this asymptotic expression is independent of the choice of a. Here, for large M, 
the benefit of having multiple antennas at the FC hinges on the relative magnitude of Ma1 i 
versus dfV^. If M<j^ oi <C d\ a o\, a reduction in variance by a factor of M is possible. In this 
case, where the SNR at the FC is low but the signals sent from the sensors are high quality, 
the coherent gain from the combination of the relatively noise-free sensor signals helps increase 
the SNR at the FC. On the other hand, when Ma'l i 3> d 2a a 2 , performance is asymptotically 
independent of M. Here, the coherent gain not only applies to 9 but also to the sensor noise, 
which is stronger in this case. 

V. Impact of Imperfect Phase 

The previous sections have assumed that the FC can calculate the vector a and feed the phase 
information back to the sensors error free. Whether the feedback channel is digital or analog, 
there are about to be errors either in the received feedback at the sensors or in how the phase 
shift is actually implemented. Furthermore, the wireless channel may change during the time 
required for calculation and feedback of a, so even if the phase shifts are implemented perfectly 
at the sensors, they may no longer be valid for the current channel. In this section, we evaluate 
the impact of errors in the sensor phase shifts on the estimation accuracy. 

Define the phase shift for the ith sensor as = e ja \ and assume that 

on = a* + A* , 

where a* is the optimal phase and Aj is a Gaussian perturbation (in radians) with zero mean 
and variance a 2 p . Define E = H H (HVH H + a^Vj^K so that Var(6 ML ) can be expressed as 

Var(6 ML ) = J = 1 , (25) 
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where e$ is the ith column of E. Let e^-e^ be a polar coordinate representation of the jth 
element of e { , so that 



|a H e,-| 2 



M 



i=i 

M 



M M 



^ e?j + ^ ^ e^e^ cos(a,* + A t + fa - a* m - A m - /3 n 



(26) 



=1 m=l 



Define ^ m = A z - A m and r/ m = a ; * + $ - a* m - /3 m . If we assume a* < 1, ([26]) may be 
approximated via a 2nd order Taylor series as follows: 

cos(r? m ) 2 \ 



W N N ( 

la^e^ 2 E e L" + E E e *V e i,m cos 
j=l ;=i m =i, V 



Km) " sin Km)^,, 



(4, 



N N N N N I 

E e L + E E e M e *,m C0S Km)~E E e M e *,m (sin(Y/ m )^ 
j=l ;=1 m=l, f=l m=l, V 



cos(r? m ) , . , 2 \ 



Substituting d27j) into (|25i we have 



Var(9 



ML) 



EililEjLi e ?j+E^i Em=i eue*,m cosK , m )-E£i Em=i e l ,/e l , m ( 



ys ( T tm.) 



Lm 



In the previous equation, the effect of the phase error is confined to the second double sum 
inside the outermost parentheses. If we define 6^ IL to be the estimate obtained with no phase 
errors, then 

1 



Var(6 ML ) 



(28) 



and we have the following approximation 



/ 



(/) 

Var{6 ML ) « Far(0 



ml) 



2^i=i 1^1=1 l^m=i e i: ie i:m I sin(r, )o l H 5 — (d, J 

\ m^Z \ 

E£i (E^Li efj+EjIi Empi e M e ijm cos(r^ m )J 



1 + 



where (/) is due to the first order Taylor approximation (1 — 2) 1 m 1 + | for i < ?/. We use 
the ratio of ^ar(^ A / L ) to Kar(#^ L ) to measure the effect of the phase error, which yields 



Var(6 



ML 



Var{6 p ML ) 



^ Eii (e£i E^i <w« (MrJ J*U + 0U 2>I ^ A 



V 



/ 
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Taking the expectation of the ratio with respect to the phase perturbations A i5 we have 



E ■ 



Var(9 ML ) 
Var(0P 



ML' 



1 + 












) 






; + EiIlE™=l e i,J e i,mCos(r/ ) 


1 



EM tt-^N i-^N / i \ 1 

<=i E/=i Lm=l e M e,, m cos(r/ m )cr; 

771.7= i 



Ei=i Ej=i e f,i+Ei=i Em=i ei,/e i)m cos(r/ ) 



/ 

(29) 



where in (g) we exploit the fact that E {5\ m } = and E j (<^ m ) 2 } = 2<Xp. Since 

iV Af AT 



Z=l m=l 



1=1 



the ratio in (|29l ) is approximately upper bounded by 



E 




< 1 



1 



(30) 



We see from (1301) that the impact of the phase errors increases with N, but in all cases the 
degradation in the estimate variance is approximately bounded above by a factor of 1 + oy 

VI. Sensor Selection 

As mentioned earlier, in situations where it is desired to use only a subset of the sensors to 
estimate the parameter (e.g., in order to conserve power at the sensors), the FC needs a method 
to perform the sensor selection. Assuming only K < N of the sensors are to be selected for 
transmission to the FC, an optimal solution to the problem would require solving the following 
maximization: 



max 

a,x 



x T D H H^ (HVXH H + a 2 n l M ) 1 HDx 



(3D 



S.t. 



i=i 

Xi = {0,1} 
\Oi\ = 1 , 

where D = diag {ai, • • • , a N }, x = [x\, ■ ■ ■ , x N } T is the selection vector and X = diagjxi, • • • , x N }. 
Even if one chooses one of the suboptimal approaches described in Section III for estimating a, 
solving for x in (|3TT) requires an exhaustive search over all possible fT-sensor combinations and 
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is in general NP-hard. Instead, in this section we derive conditions under which much simpler 
selection strategies can be applied. We consider the following two cases: (1) low sensor noise 
relative to the noise at the FC, a^i ^ a n' an d (2) relatively high sensor noise cr^ i a^. For 
(1), we derive a LP solution as well as a simpler greedy algorithm, and for (2) we show that 
the problem reduces to choosing the sensors with the lowest measurement noise. 

A. Algorithms for High FC Noise 

Let a be the phase vector obtained using one of the algorithms in Section III assuming all 
N sensors are active. When a'l i <C of t , we ignore the term HVXH^ in (l3l"l) . and the problem 
simplifies to 



x 

N 
i=l 

x t = {0,1} 



max x T D fl H ff HDx (32) 

X 

N 

i.t. Xi = K 



Define F = D H H H HD so that ([32]) can be rewritten as 

max x T Re{F}x (33) 

X 

N 

s.t. Xi = K 

i=i 

Xl = {0,1} . 

Since x\ = Xi, d33l is equivalent to 

TV N-1 N 

max ^Fi^Xi + 2^2 ^ ReiFijjxiXj (34) 



i=l i=l j=i+l 

N 

i=l 

x t = {0,1} 



Xi = K 
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where denotes the (i, j)th element of matrix F. By linearizing the term x t xj 11311 , (1341 ) is 
equivalent to 



N N-l N 



max ^ F hi x i + 2 Rc l F ',/ 1 .'/-;, (35a) 

i=l i=l j=i+l 

N 

s.t. s ^x i = K (35b) 

i=l 



1 - Xi - xj + yij > (35c) 

Xi - Vij > (35d) 

xj - Vij > (35e) 

Vis > (35f) 

^ = {0, 1} , (35g) 



where the constraints (|35cb -( [35g| ) lead to 



Note that all of the constraints in (1331 ) are linear, except for (|35g[). If we relax the constraint 



in (|35g[), the condition < Xi < 1 is implicitly included in (|36cl) - (|36fl) . and we are left with a 



LP problem in standard form [|3T| : 

N N-l N 

max ^ F iti Xi + 2 Re{P ij -}y i j (36a) 



i=l i=l ]=i+l 

N 



s.t.^Xi = K (36b) 



i=i 



1 — Xi — Xj + yij > (36c) 

Xi - Vij > (36d) 

Xj - Vij > (36e) 

Vij > . (36f) 

To find the Xj = {0, 1} solution needed for sensor selection, one can take the result of (l36l) and 
simply set the K largest elements to one and the rest to zero. If desired, once the K sensors have 
been selected, the phase vector a for these K sensors can be recomputed based on a reduced 
dimension version of the algorithms in Section III. 
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The above LP problem has Af ^~ 1 - > + iV variables and 2N(N — 1) + 1 constraints, and thus 
will require on the order of ^ N ~^ + n) {2N{N - 1) + 1) arithmetic operations (271. A 
simpler greedy algorithm is presented below that only requires O(KN) operations, and that 
achieves performance close to the LP approach. The greedy algorithm is based on the following 
observation: 

K K 

x T D H H iJ HDx = J2J2® ia i h ? h i 

i=i j=i 

K-lK-l (K-l 

— ^ ^2 ®i a jhfhj + ||hx|| 2 + 2Re < ^ a K ajh^hj 

i=l j=l [j=l 

The idea behind the greedy algorithm is to add sensors one at a time based on those for which 
the last two terms in the above sum are the largest. The steps of the algorithm are detailed below. 

Greedy Sensor Selection Algorithm 

1) Select the first sensor as the one with the strongest channel: i = argmax fc ||h fe || 2 , and 
initialize the active sensor set as S = {i} . 

2) While |<S| < K, perform the following: 

a) Solve 

i 



argmax ||hfc|| 2 + 2Re < akdjh^hj > . 
HS [jes J 



b) Update S = S \J i . 

As with the LP algorithm, once the K sensors are selected, an updated solution for the associated 
K elements of a can be obtained. 

B. Algorithm for High Sensor Noise 

When ex 2 • ^> a\ and assuming that N > M (the case of interest when sensor selection is 
necessary), the original criterion can be simplified to 

a ff H fl (HVH ff ) _1 Ha = a ff V^ V^H^ (HVH ff ) ^ HV^ V^a 

= a fl V^P ra V^a , 

where ~P VH = is a rank M projection matrix. Ideally, to maximize 

the criterion function, one should attempt to find a vector of the form V~aa that lies in the 
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subspace defined by Pvh- Assuming the vector a can approximately achieve this goal, the lower 
bound on variance is approximately achieved and we have 

! „ 1 = 1 (37) 

a»V-i Pl .„V-ia a»V-.a E f_ 1 ^_ ' 

With respect to the sensor selection problem, this suggests that when a 2 , the K sensors 

with the smallest values of a 2 u { should be chosen. 

VII. Simulation Results 

Here we present the results of several simulation examples to illustrate the performance of 
the proposed algorithms. In all cases, the path loss exponent a was set to 1, and each result is 
obtained by averaging over 300 channel realizations. The sensors are assumed to lie in a plane 
at random angles with respect to the FC, uniformly distributed over [0,2tt). The distances of 
the sensors to the FC will be specified separately below. To evaluate the performance without 
feedback, a is set to a vector of all ones. In some of the simulations, we will compare the 
performance of the proposed algorithms with that obtained by ©, where both the sensor gain 
and phase can be adjusted. In these simulations, we use the active-set method to optimize ©, 
and we use several different initializations in order to have a better chance of obtaining the 
global optimum. When the ACMA algorithm is implemented, the subspace dimension was set 
at m = 2. 

In the first two examples, we study the estimation performance for M = 4 FC antennas with 
increasing N for a case where the sensor measurement noise a1 i is uniformly distributed over 
[0.01, 0.1] and the FC noise a\ is set to 0.1. Fig. Q] shows the results assuming that the sensor 
distances di are uniformly distributed in the interval [3,20], while in Fig. |2] di = 11.5 for all 
sensors. In both cases, even though the lower bound of © is not achievable, we see that the 
performance of the proposed SDP and ACMA methods is nonetheless reasonably close to the 
bound, and not significantly worse than the performance obtained by optimizing both the phase 
and gain. As N gets larger in Fig. [Q the estimation error for all of the methods (except the 
no-feedback case) falls within the asymptotic lower and upper bounds of (fTTT) and (fl~8l) . When 
N = 50, the ratio Var |^r| /E |^| is 0.304 for Fig. [Q and the ratio between the lower and 
upper bound is 0.702, which is in excellent agreement with the value of 1 — 0.304 predicted by 
Eq. (fl9i Since the upper bound in (fl"8l) corresponds to the case of M = 1, one may suppose that 
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the gap in Fig. Q] between the bounds of (fTTT ) and (TT8T ) indicates that the presence of multiple 
antennas at the FC could provide a benefit for large N. However, the performance of SDP 
and ACMA is approaching the upper bound more tightly, indicating that there is no benefit 
from having multiple antennas in this case. In Fig. [2] where the are all equal, the asymptotic 
bounds in (fTTT) and (fT8l) are identical, and asymptotically we expect no benefit from multiple 
antennas at the FC. We see again that for large iV the performance of the SDP and ACMA 

2 

methods is essentially at the predicted bound. When the di are equal and -^r <C the matrix 

i 

H fl (HVH H + cr^I M ) _1 H asymptotically approaches a scaled identity matrix, so in this case 
the performance of the proposed phase-shift only algorithms even approaches the lower bound 
of Eq. ©. 

Fig. |3] illustrates the performance for iV = 4 with an increasing number of FC antennas M 
when o'l i is uniformly distributed over [0.001, 0.01] and o\ = 0.1. In this example, for most of 
the sensors we have Ma^ <C dj a a^, so in this case we see an improvement as the number of 
FC antennas increases. However, the benefit of optimizing the transmit phase (and gain for that 
matter) is reduced as M increases. 

In Fig. HI we investigate the effect of phase errors for two cases, = 0.1 and = 0.2 
assuming the same noise parameter settings as in the first two examples. For each channel 
realization, results for 3000 different phase error realizations were obtained and averaged to 
obtain the given plot. The ratio of the variance obtained by the SDP algorithm with and without 
phase errors is plotted for M = 2, 4, 6 for both values of ai, and the approximate bound of (|30l) 
is also shown. The results show that the performance degradation increases with N, and that (|30l) 
provides a reasonable indication of performance for large N. Fig. |4] also shows that increasing 
the number of FC antennas improves the robustness of the algorithm to imprecise sensor phase. 

In Fig. |51 we compare the performance of the three different sensor selection algorithms 
discussed in the paper (LP, greedy and min-sensor-noise) as a function of a\ assuming M = 4 
antennas, N = 35 sensors and the sensor noise is uniformly distributed over [0.001,0.01]. The 
sensor distances dj are uniformly distributed in the interval [3,20]. Three sets of curves are 
plotted, one for K = 5 selected sensors, one for K = 10, and one corresponding to when all 
the sensor nodes are used (the solid curve, obtained using the SDP algorithm). After the sensor 
selection, the proposed SDP is used to re-optimize the selected sensor nodes' phase parameters. 
For small cr^ such that al i a\, we see as predicted that the best performance is obtained by 
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simply selecting the K sensors with the smaller measurement noise. On the other hand, again in 
agreement with our analysis, the LP and greedy algorithms achieve the lowest estimation error 
for larger values of a 2 . Interestingly, the greedy algorithm provides performance essentially 
identical to the LP approach at a significantly reduced computational cost. 

VIII. Conclusions 

In this paper, we investigated a distributed network of single antenna sensors employing a 
phase-shift and forward strategy for sending their noisy parameter observations to a multi-antenna 
FC. We presented two algorithms for finding the sensor phase shifts that minimize the variance of 
the estimated parameter, one based on a relaxed SDP and a closed-form heuristic algorithm based 
on the ACMA approach. We analyzed the asymptotic performance of the phase-shift and forward 
scheme for both large numbers of sensors and FC antennas, and we derived conditions under 
which increasing the number of FC antennas will significantly benefit the estimation performance. 
We also analyzed the performance degradation that results when sensor phase errors of variance 
a 2 are present, and we showed that for large N the variance will approximately increase by 
a factor of 1 + a 2 provided that a 2 <C 1 square radian. The sensor selection problem was 
studied assuming either low or high sensor noise with respect to the noise at the FC. For low 
sensor noise, two algorithms were proposed, one based on linear programming with a relaxed 
integer constraint, and a computationally simpler greedy approach. For high sensor noise, we 
showed that choosing the sensors with the smallest noise variances was approximately optimal. 
Simulation studies of the proposed algorithms illustrate their advantages and the validity of the 
asymptotic analyses. 
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Fig. 1. Performance of the proposed algorithms with an increasing number of sensors for a low measurement noise scenario 
(crj; = 0.1, a% j uniformly distributed over [0.01, 0.1], di uniformly distributed over [3, 20] and M = 4). 




Fig. 2. Performance of the proposed algorithms with an increasing number of sensors for a low measurement noise scenario 
(al = 0.1, cr^j uniformly distributed over [0.01, 0.1], d, = 11.5 and M = 4). 
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Fig. 3. Performance of the proposed algorithms with an increasing number of antennas (a^ = 0.1, a'i A uniformly distributed 
over [0.001, 0.01], di uniformly distributed over [3, 20] and N = 4). 
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Fig. 4. Effect of phase errors on algorithm performance (crj; = 0.1, o% 4 uniformly distributed over [0.01, 0.1] and di uniformly 
distributed over [3, 20]). 
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Fig. 5. Performance comparison between different sensor selection algorithms (N — 35, M = 4, o'i i uniformly distributed 
over [0.001,0.01] and di uniformly distributed over [3,20]). 
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